An incremental linear-time learning algorithm for the Optimum-Path Forest classifier

نویسندگان

  • Mateus Riva
  • Moacir Ponti
چکیده

We present a classification method with linear-time incremental capabilities based on the Optimum-Path Forest (OPF) classifier. The OPF considers instances as nodes of a graph where the edges’ weights are the distances between two nodes’ feature vectors. Upon this graph, a minimum spanning tree is built, and every edge connecting instances of different classes is removed, with those nodes becoming prototypes or roots of a tree. A new instance is classified by discovering which tree it would conquer. In this paper we describe a new training algorithm with incremental capabilities to update the model by including new instances into one of the existing trees; substituting the prototype of a tree; or splitting a tree. This incremental method was tested for accuracy and running time against both full retraining using the original OPF and an adaptation of the Differential Image Foresting Transform. The method is able to include a new instance in linear-time, while keeping similar accuracies when compared with the original model, which runs in quadratic-time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Identify Non-Technical Losses with Optimum-Path Forest

In this work we have proposed an innovative and accurate solution for non-technical losses identification using the Optimum-Path Forest (OPF) classifier and its learning algorithm. Results in two datasets demonstrated that OPF outperformed the state of the art pattern recognition techniques and OPF with learning achieved better results for automatic nontechnical losses identification than recen...

متن کامل

Supervised Pattern Classification Using Optimum-Path Forest

We present a graph-based framework for pattern recognition, called Optimum-Path Forest (OPF), and describe one of its classifiers developed for the supervised learning case. This classifier does not require parameters and can handle some overlapping among multiple classes with arbitrary shapes. The method reduces the pattern recognition problem into the computation of an optimum-path forest in ...

متن کامل

Application of ensemble learning techniques to model the atmospheric concentration of SO2

In view of pollution prediction modeling, the study adopts homogenous (random forest, bagging, and additive regression) and heterogeneous (voting) ensemble classifiers to predict the atmospheric concentration of Sulphur dioxide. For model validation, results were compared against widely known single base classifiers such as support vector machine, multilayer perceptron, linear regression and re...

متن کامل

Land Use Classification Using Optimum-Path Forest

It was introduced in this paper the Optimum-Path Forest for land use classification aiming a better environmental management, using images obtained from CBERS 2B CCD satellite covering the area of the Rio das Pedras watershed, Itatinga City, São Paulo State, Brazil. We also compared the Optimum-Path Forest algorithm with the well known supervised classifiers: Artificial Neural Networks using Mu...

متن کامل

A Random Forest Classifier based on Genetic Algorithm for Cardiovascular Diseases Diagnosis (RESEARCH NOTE)

Machine learning-based classification techniques provide support for the decision making process in the field of healthcare, especially in disease diagnosis, prognosis and screening. Healthcare datasets are voluminous in nature and their high dimensionality problem comprises in terms of slower learning rate and higher computational cost. Feature selection is expected to deal with the high dimen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Process. Lett.

دوره 126  شماره 

صفحات  -

تاریخ انتشار 2017